A Combined Extractive With Abstractive Model for Summarization
نویسندگان
چکیده
Aiming at the difficulties in document-level summarization, this paper presents a two-stage, extractive and then abstractive summarization model. In first stage, we extract important sentences by combining similarity matrix (only used for time) or pseudo-title, which takes full account of features (such as sentence position, paragraph more.). To coarse-grained from document, considers differentiation most document. The second stage is abstractive, use beam search algorithm to restructure rewrite these syntactic blocks extracted sentences. Newly generated summary serves pseudo-summary next round. Globally optimal pseudo-title acts final summarization. Extensive experiments have been performed on corresponding data set, results show our model can obtain better results.
منابع مشابه
From Extractive to Abstractive Summarization: A Journey
The availability of large documentsummary corpora have opened up new possibilities for using statistical text generation techniques for abstractive summarization. Progress in Extractive text summarization has become stagnant for a while now and in this work we compare the two possible alternates to it. We present an argument in favor of abstractive summarization compared to an ensemble of extra...
متن کاملExtractive and Abstractive Event Summarization over Streaming Web Text
During crises, information is critical for responders and victims. When the event is significant, as in the case of hurricane Sandy, the amount of content produced by traditional news outlets, relief organizations, and social media vastly overwhelms those trying to monitor the situation. The ensuing digital overload that accompanies large scale disasters suggests an opportunity for automatic su...
متن کاملA Deep Reinforced Model for Abstractive Summarization
Attentional, RNN-based encoder-decoder models for abstractive summarization have achieved good performance on short input and output sequences. For longer documents and summaries however these models often include repetitive and incoherent phrases. We introduce a neural network model with a novel intraattention that attends over the input and continuously generated output separately, and a new ...
متن کاملPolytope Model for Extractive Summarization
The problem of text summarization for a collection of documents is defined as the problem of selecting a small subset of sentences so that the contents and meaning of the original document set are preserved in the best possible way. In this paper we present a linear model for the problem of text summarization, where we strive to obtain a summary that preserves the information coverage as much a...
متن کاملA Publicly Available Indonesian Corpora for Automatic Abstractive and Extractive Chat Summarization
In this paper we report our effort to construct the first ever Indonesian corpora for chat summarization. Specifically, we utilized documents of multi-participant chat from a well known online instant messaging application, WhatsApp. We construct the gold standard by asking three native speakers to manually summarize 300 chat sections (152 of them contain images). As result, three reference sum...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Access
سال: 2021
ISSN: ['2169-3536']
DOI: https://doi.org/10.1109/access.2021.3066484